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Abstract 

The Hamiltonian structure of general relativity provides a natural 
canonical measure on the space of all classical universes, i.e., the mul- 
tiverse. We review this construction and show how one can visualize the 
measure in terms of a "magnetic flux" of solutions through phase space. 
Previous studies identified a divergence in the measure, which we ob- 
serve to be due to the dilatation invariance of flat FRW universes. We 
show that the divergence is removed if we identify universes which are so 
flat they cannot be observationally distinguished. The resulting measure 
is independent of time and of the choice of coordinates on the space of 
flelds. We further show that, for some quantities of interest, the measure 
is very insensitive to the details of how the identification is made. One 
such quantity is the probability of infiation in simple scalar field models. 
We find that, according to our implementation of the canonical measure, 
the probability for A'^ e-folds of inflation in single-field, slow-roll models is 
suppressed by of order exp(— 3A'^) and we discuss the implications of this 
result. 



1 Introduction 



The problem of comparing different possible histories of the universe, and as- 
signing a probability to each, is central to theoretical cosmology. We cannot 
expect a fundamental theory to predict precisely what we see today: at best, 
it should predict an ensemble or "multiverse" of possible universes, with the 
universe we observe being a typical member. 

Of course there are many attitudes that one may take to probability not only 
in the context of cosmology, but in science more generally. The stance adopted 
in this paper is that in practising science we often adopt Bayesian methods 
which make essential use of a priori probabilities (even if one does not accept 
this, or has doubts about what these probabilities mean, this is certainly a fair 
characterisation of much of the current observational literature) . 
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In fact, following Laplace's Principle of Indifference [T], we often start with 
the least informative probability distribution, i.e., a flat distribution, and then 
sharpen it in the light of additional information. This is, as we see it, the essence 
of the Bayesian approach. It requires a well defined a priori measure, i.e., one 
with finite total measure. 

At the risk of falling into pedantry let us spell this out more formally. We 
define the a priori probability P(U) to be the probability that the universe is 
U, according to some fundamental theory. Likewise we define P{0) to be the 
probability of making an observation O in any universe. The joint probability 
that the universe is U and that we make an observation O is P{U O). The 
conditional probability of making an observation O in a universe U is P{0\U) 
(called the likelihood) , and the conditional probability that we are in the universe 
U given that we have made an observation O is P(U\0) (called the a posteriori 
probability). It follows from elementary considerations that 

p{u\o)p{0) ^ p{ur\0) ^ p{o\u)P{u) (1) 

whence the Cosmic Bayes's Theorem P] tells us that 

_ PiO\U)P{U) 

where the integral is over what we call the multiverse, M , and dU is a measure 
on the multiverse. Equation ^ describes how the a priori probability P(U) is 
updated in the light of observations. 

In this paper, we define the multiverse M to be the set of all possible uni- 
verses, or, better, the set of all model universes. These are by definition dis- 
connected from one another. This differs from other popular interpretations in 
which the multiverse is a connected spacetime containing many causally discon- 
nected regions and, roughly speaking, probabilities are taken to be proportional 
to the numbers of such regions. This latter concept has been termed the metau- 
niverse [31 |4j . It is not obvious to us that this second meaning can be made 
quantitively precise in such a way that the probabilities are well defined. The 
basic problem is that the natural measure on spacetime, J d'^x^—g, is usually 
infinite in the scenarios being considered, and the infinity is not removed by re- 
stricting attention to so-called reheating surfaces. Much of the current literature 
on inflation appears to reflect this difficulty. 

By contrast to Laplaces's Principle, one may regard a Proposal for the State 
of The Universe as some choice of probability distribution function P{U) relative 
to the a priori measure dU . It might possibly constitute an "explanation" if 
derived from some underlying fundamental theory which, again, must be well 
defined to be meaningful. Note that in this paper we are not setting out to 
make a specific proposal for the universe, i.e., we are not advocating a specific 
formula for P{U), but rather, as we shall argue in detail later, we are adopting 
Laplace's Principle of Indifference and finding that indeed this is inadequate to 
favour inflation. A more specific, sharper probability distribution is required to 
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explain inflation. One may quantify the sharpness of a probabiHty distribution 
by defining (following von Neumann, Shannon and others) the information 



which, after imposing the constraint J P{U)dU — 1, is minimised by the uniform 
distribution P(U) =constant, as one can easily check by computing the first and 
second variational derivatives and using P{U) > 0. The quantity / measures the 
information (or lack of randomness) carried in a typical realisation of P{U): the 
least information is obtained if all possible outcomes are equally likely, and the 
greatest if a single outcome is certain. The information entropy S is then defined 
to be the negative of /, which is of course maximised in the uniform distribution. 
As is well-known, maximising S under various additional constraints including 
the conservation of energy yields the usual formulae of statistical mechanics. 

In this paper, we show that, with the canonical definition of dU given by 
Hamiltonian dynamics, Laplace's principle does not typically predict inflation. 
Hence, even if a fundamental theory allows inflation, a sharper P{U)is required 
in order to explain why inflation actually occurred. It remains as a challenge to 
fundamental theory to explain how such a P{U) might arise. The predictiveness 
of any particular proposal can be assessed from ([3]); what we would be most 
happy with is a highly predictive theory, whose predictions were consistent with 
observation. In fact this idea can be taken further, as we have been reminded 
by Don Page. If we regard a proposal P{U) as a hypothesis then we may, given 
an appropriate measure on the infinite dimensional set of hypotheses {P{U)}, 
adopt Bayesian methods to evaluate their relative probabilities. We shall not 
attempt this formidable task in this paper. 

The ideas sketched above are not new, and many people have pursued them 
before in some, largely qualititive, way. In this paper we attempt , following [5] , 
to make them quantitative. In fact our own viewpoint was influenced in part by 
trying to make some of Penrose's arguments [3 [71 [H| mathematically precise. 

There are several additional problems peculiar to cosmology when attempt- 
ing to construct a statistical theory. First, there there is the problem of general 
covariance. There is no absolute notion of space or time in general relativity: 
these are properties of each particular classical solution of the field equations. 
Second, the solutions generically possess singularities in the past or the future, 
where the field equations break down in finite time. Even if one makes the 
drastic simplification of restricting attention to Friedmann-Robertson- Walker 
(FRW) universes, i.e., so-called mini-superspace, it is not obvious how to com- 
pare different classical spacetimes which are solutions to the same laws of 
physics. Finally, in theories like string or M-theory, where the low energy ef- 
fective description involves many additional fields, there may not even be a 
preferred space-time metric: one can change coordinates on the space of fields 
(for example via Weyl transformations) in an arbitrary way and physical results 
should not depend on these choices. 

In spite of these difficulties, the problem of constructing sensible measures 
on the space of solutions is of undeniable importance to the evaluation of various 
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cosmological scenarios. Most of these scenarios focus on some subset of classical 
solutions, for example arguing that they are "generic" because they exhibit 
dynamical attractor behavior. But to understand how predictive the proposals 
really are, in the absence of any further information, we need to quantify the 
extent to which the flat probability measure, without any additional input, 
succeeds in narrowing down the range of allowed classical solutions. 

In recent times, the problem of the measure in cosmology has grown hugely 
in importance. The claimed vast "landscape" of possible compactifications and 
moduli in string theory seems to offer a truly bewildering range of classical 
cosmologies [5]. When combined with recent observations of dark energy, the 
landscape picture has encouraged many physicists to pursue anthropic explana- 
tions. However, the lack of an a priori measure on the space of model universes 
is a serious flaw in such attempts, as even the most ardent proponents will ad- 
mit. Nevertheless, remarkably little attention has been devoted to constructing 
measures whereby the different possible classical spacetimes within the "multi- 
verse" can be compared [TSKTH]. In a similar vein to the anthropic arguments, 
a "top down" approach to cosmology has been proposed by Hawking and Her- 
tog [13], in which one imposes constraints on the state of the universe today 
and attempts to determine its most probable history, for example by constrain- 
ing the initial state to satisfy Hartle and Hawking's no-boundary proposal [13]. 
However, it is hard to judge the success or otherwise of this approach without 
being able to quantify the extent to which it narrows down the multiverse. 

Nearly two decades ago. Gibbons, Hawking and Stewart identified a nat- 
ural measure on the set of classical cosmological solutions. They argued that 
any such measure should satisfy the following requirements: (i) it should be 
positive, (ii) it should depend only on the intrinsic dynamics and neither on any 
choice of time slicing nor on the choice of dependent variables, and (iii) it should 
respect all the symmetries of the space of solutions without introducing any ad- 
ditional ad hoc structures (e.g. "Planck mass cutoffs" ) not arising from the field 
equations themselves. They showed that a measure satisfying all these require- 
ments arises naturally from the Hamiltonian structure of general relativity and 
that it can be used to count the number of different solutions of the classical 
field equations in a well-defined way. Note that the set of all Cauchy data is 
unsuitable for this purpose because different Cauchy data can yield the same 
classical solution, evaluated on a different time-slice. The proposed measure 
avoids this overcounting problem by counting each distinct classical solution, in 
its entirety, only once. 

One obvious application of such a measure on the multiverse is in determin- 
ing the probability of inflation, i.e., how likely is it that the universe started 
out in an inflationary state, within different models of the laws of fundamental 
physics. This question is potentially an Achilles' heel for inflation: if inflation 
is itself highly improbable, then it cannot be claimed to solve the classic cos- 
mological fine tuning puzzles, of large-scale homogeneity, isotropy and flatness. 
Initially, it seemed that the canonical measure did indeed favor inflation [5] , in 
simple inflationary models, but more detailed investigations, in particular by 
Hawking and Page [12], identified a serious ambiguity. They found that the 



4 



canonical measure is infinite, and that both inflationary and non-inflationary 
solutions typically have inflnite measure. Unfortunately, this left the problem 
of whether inflation is likely, or unlikely, unresolved. Since then, the problem 
has not received much attention, although various intuitive arguments, gener- 
ally not satisfying the three above-mentioned conditions, have been given for or 
against inflation. 

In this paper, we shall revisit the canonical measure, showing that when 
treated with sufficient care, a finite measure, still satisfying the Gibbons-Hawking- 
Stewart conditions, can be obtained. In agreement with Hawking and Page, we 
find the canonical measure is infinite. However, we identify the divergence as 
being due to the dilatation symmetry of fiat FRW universes, and we show it is re- 
moved if one identifies solutions which cannot be observationally distinguished. 
Concretely, we impose a limit on the parameter known in the observational 
literature as flk, measuring the ratio of the space curvature term in the Fried- 
mann equation to the square of the Hubble parameter. If all universes with \ flk\ 
smaller than some observational limit Aflk <C 1 are identified, then the canon- 
ical measure becomes well-defined. Furthermore, provided the limit on |r2fc| is 
imposed once the matter fields have entered a phase of evolution in which the 
expansion of the universe acts adiabatically, i.e., as a slow variation of parame- 
ters in the matter Lagrangian, our measure reduces to the canonical measure on 
the matter fields alone, and is an adiabatic invariant. In this regime the mea- 
sure becomes independent of the conjugate "angle" variable. This statement is 
independent of the time, the values of the cosmological parameters, or indeed 
the value of Afifc when the cutoff is imposed. These good properties motivate 
us to reconsider the probability of inflation with various numbers of e-foldings, 
using the proposed canonical measure. The probability of N e-folds of inflation 
depends critically on the "angle" variable, and we find that the probability of 
obtaining N e-folds of infiation in simple infiationary models is suppressed by a 
factor of exp(— 3iV). Since > 50 — 60 is typically required in realistic models 
(depending on the efliciency of reheating), it follows that inflationary solutions 
are, in fact, tremendously rare in the space of classical solutions. We emphasize 
these statements are independent of the details of the cutoff imposed on the 
canonical measure. 

Our conclusion is, of course, at variance with much of the inflationary litera- 
ture over the last two decades. It has some resonance with the well-known argu- 
ments (see e.g. Refs. |l5l[6j[7l[8]) that inflation cannot possibly solve the classical 
cosmological puzzles for arbitrary initial conditions because any physically per- 
missible current state of the universe would, if run back in time, correspond to 
some initial conditions, and also with Holland and Wald's arguments [16] that 
any canonical measure would naturally assign the same probability to deflation 
as to inflation, since Einstein's equations are time-reversal invariant. Because 
the canonical measure is a measure on the set of universes, independent of their 
time-orientation, we do indeed find this result: both inflation and deflation are 
predicted to be exponentially rare phenomena among the set of all classical 
solutions of the field equations. 

The structure of this paper is as follows. In Section 2 we review the construc- 
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tion of the canonical measure, clarifying the geometrical structure in elementary 
terms and in particular demonstrating that the measure is independent of the 
choice of initial slice in the space of dynamical variables (contrary to the appar- 
ent assertion of Hollands and Wald on this matter ^TE' ). In Section 3, we discuss 
the classical dynamics of simple scalar field models and we compute the canon- 
ical measure for scalar field matter in FRW spacetimes with arbitrary spatial 
curvature. At face value, the measure exhibits a divergence at large scale factor, 
i.e., in the flat space limit, but we argue that since the scale factor is unobserv- 
able in this limit, one must factor out dilatations from the result. We show 
that this results in a sensible measure on the space of classical solutions, i.e., 
one of finite total measure. In Section 4 we solve the classical field equations, 
identifying the solutions undergoing N e-folds of inflation and computing the 
associated measure. Section 5 compares our results with those obtained in pre- 
vious discussions, and explains why our conclusions are so different from those 
reached on the basis of intuitive reasoning from "chaos at the Planck density" . 
Section 6 concludes. 



2 Liouville measures and magnetic flux 

The principles behind the canonical measure are extremely simple. If one re- 
stricts attention to consistent (in the technical sense) finite-dimensional trun- 
cations, so-called mini-superspace models, then we are faced with a standard 
problem in Hamiltonian mechanics, and it is natural to bring to bear on the 
problem the standard techniques of statistical mechanics which have been used 
so successfully in all other areas of physics. 

The Einstein-matter equations provide a Hamiltonian flow in a 2n-dimensional 
phase space P, equipped with a symplectic form lu which may be written in local 
Darboux coordinates as 

71 

= ^ dpi A dq^ . (4) 
The n'th power of u gives the Liouville volume element on P 

/ -|yi(n-l)/2 

L- tj" = d"p (f'q . (5) 

n! 

However, this is not what we want. We want to describe the set of distinct 
dynamical trajectories or, cquivalently, the set of classical initial conditions 
giving distinct histories. In general relativity, the Hamiltonian 7i is constrained 
to vanish so the trajectories all lie on a 2n—l dimensional constraint submanifold 

c = n-\o). (6) 

The set of classical trajectories is the Marsden-Weinstein quotient, also some- 
times called the reduced phase space. This is the multiverse, 

M = C/R = H-\0)/E., (7) 
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where M is the Hamiltonian flow. As Gibbons, Hawking and Stewart showed [S] 
(see also [10]), a symplectic form on the multiverse M is naturaUy inherited 
from uj: one simply chooses local coordinates in which p„ = 7i, from which 
Hamilton's equations imply that q" = t, the time. As a result, one has 

ri-l 

uj^^dp^h dq' + dnA dt, . (8) 

1=1 

so that the restriction to the constraint surfaces, Ti — Q, naturally yields a two- 
form on the space transverse to the Hamiltonian flow, ojc = oj\u=a- We now 
construct a measure on M by raising ujc to the [n — l)'th power: 



C_iy"-l)("-2)/2 
V , .n — 1 

(n- 1)! 



^ ' \^ u:l-\ (9) 



Gibbons, Hawking and Stewart showed that the flux obtained by integrating 
this form is positive if measured in the correct sense, independent of slicing 
in phase space and invariant under Hamiltonian flow. We shall review this 
argument in more elementary terms below. Furthermore, ^ is natural in the 
sense that it requires no new elements in the theory other than those already 
present in the classical equations of motion. By its construction, it is invariant 
under any additional canonical symmetries. Hence, conditions (i)-(iii) given in 
the introduction are satisfied. 

One can visualize the measure © as the "flux" of a divergence-free "mag- 
netic field" . In general coordinates on phase space, the symplectic form is a 
covariant second rank anti-symmetric tensor field with components satisfying 

= -uj^f,, (10) 

where /z, = 1 . . . 2n, and detw ^ 0. The symplectic form is closed, 

dij = 0, (11) 

which implies 

^[^W.r] = 0, (12) 

and Hamilton's equations are 

V^' = uJ^"'^,n, (13) 

with the summation convention, where oj^'^ is the inverse of cj^^ and the ve- 
locity on phase space, = (dx^/dt). Alternately, we may rewrite Hamilton's 
equations as 

Lo^,,v = d^n, (14) 

from which one obtains 

vf'd^n = 0, (15) 

so the flow lies in the surfaces H — constant. 
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Now let us choose coordinates such that 



x^" = n. (16) 

The closure condition, restricted to "spatial" indices, corresponding to directions 
tangent to the constraint manifold, is 

d[a^bc] = 0, (17) 

where a,b = 1, . . . , 2n — 1, and we have 

= 0, (18) 



since the Hamiltonian has no explicit time-dependence. Thus from ([T4|), and 
the fact that all "spatial" derivatives of H are zero, we have V^ojab = 0. 

It is simplest to see what this means in the example n = 2, for which a,b,c 
run from 1 to 3. Define a magnetic field by 

Ba = -^abc^^bc, (19) 

then because uj is closed, B is divergence-free, 

daBa = 0. (20) 

Moreover, 

eabcBbVc - 0, (21) 

thus V is parallel to B. In these formulae, we have lowered indices using the 
Kronecker delta, 6ab- Now, elementary arguments using the divergence theorem 
establish the point that the flux through some fixed surface is unchanged if the 
surface is deformed while keeping its boundary fixed, andunchanged if the sur- 
face is propagated forwards with the flow. Note that we have introduced 5ab and 
eabc, and are only dualizing only for convenience: we are merely implementing 
the standard rules of exterior calculus in a particular coordinate system and 
the key results, that the fiux is conserved and that the fiow lies parallel to the 
magnetic field are independent of the coordinate system and do not require a 
metric. 

The argument easily generalizes to higher dimensions, with the magnetic 
field constructed by dualizing the (n— l)'th power of the Hamiltonian symplectic 
form. Let ^ 

Ba = 2n-'^{n ^^^abcdc.ght^bc^de ■ ■ ■ ^gh- (22) 

Then 

daBa = (23) 

follows from the closure of lo and 

^^[bc^de ■ ■ ■ ^gh] = £abcde...ghBa (24) 
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and so 

£abcde...ghBaVb ~ 0, (25) 

which impUes that 

BiaVb] = 0, (26) 

that is, Va is parallel to Ba- 

We end this section with a final point. Since u; is closed, it follows ujc is 
also closed on C and that the measure on the multiverse Q]\{ cx uj^~^ is also 
closed. Hence one can always locally write = dA, with A some n — 2-form, 
arbitrary up to "gauge transformations" A ^ A + dx- However, the canonical 
construction gives more than this: lu may be written as d{pidq') globally. So 
there is a natural definition of the "vector potential" A, allowing one to reduce 
the n— 2-dimensional integral of Qm over some surface S to an n — 3-dimensional 
integral of A over the boundary of S. The latter integral may then be regarded 
as giving the integrated probability measure for all trajectories passing through 
5* or any topologically equivalent surface with the same boundary. 



3 Gravity and a Scalar Field 

In this section we consider a single minimally-coupled scalar field </> with po- 
tential V{(j)) in a homogeneous and isotropic (FRW) universe. Generalizing the 
discussion to to additional scalar fields and other fields and fluids, or anisotropic 
cosmologies should be straightforward [20l [21] . But the simplest case is inter- 
esting enough that we shall devote the remainder of this paper to it. A version 
of this model with two scalar fields, and its statistical properties, was analyzed 
by Starobinsky from a different point of view With the above-mentioned 
symmetry restrictions, the line element is 

- N^df + a'^{t)-/,jdx'dx^, (27) 

where "fij is a metric on a space of constant (three-dimensional) scalar curvature 
/c = or ±1. Choosing units in which Mp^ = 1/(8ttG) = 1, the Einstein-scalar 
action is 

S = J dtN (^-3a{N-^a'^ - k) + ia^iV^^^'^ - a^V{(j))^ , (28) 

where primes denote t derivatives. 

Varying the action with respect to the lapse function TV yields the usual 
Friedmann equation 

where dots denote proper time derivatives, with dr = Ndt, and H = d/a is 
the expansion rate or Hubble parameter. Varying with respect to (j) yields the 
scalar field equation 

4> + 3H^ = -~V^. (30) 
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Taking the time derivative of ([^5]) and using ([50]) then yields 

Finally, varying with respect to a yields a linear combination of (|29p and ([?T|) . 
Equation ([?T|) will be of particular interest to our later discussion. For fc < 0, 
the Hubble parameter H never increases, so no classical trajectory can cross 
a 7?=constant hypersurface more than once. Likewise, from ([29|) . if V is non- 
negative, then for k < 0, H can never change sign. This implies that no classical 
trajectory can cross an a = constant surface more than once. 

We now turn to the Hamiltonian analysis. The canonical momenta conjugate 
to a, (f) and N are 

Pa = —6aa — —6a^H, = a'^(j), pN = 0, (32) 

and the Hamiltonian is 

'H^N(^^ + \^^ + a^V{<P)^iak\, (33) 
\ 12a 2 a-^ / 

which vanishes by the equation of motion for p^v- We can use this to elimi- 
nate one of the four canonical variables a,pa, '/'iP^- Since the Hamiltonian is 
quadratic in both pa and p^, either may be easily eliminated. Both choices have 
some merit, as we shall discuss below. 

The first choice, used in previous treatments, is to eliminate pa- This has 
the advantage that it is easily generalized to many matter fields, whether or not 
they have minimal kinetic terms. It proceeds as follows. From the vanishing of 
(|33|) we obtain 

Pa = ±y^6p2a-2 + l2a4F(0) - ma?k. (34) 

The choice of sign will occur in many subsequent formulae, but leads to no 
ambiguity. All it means is that at generic values of the remaining variables 
a, (/) and p^, there are two possible solutions, representing an expanding or a 
contracting universe. 

It is now convenient to change non-canonical coordinates to (/>, (j) and A = In a. 
The velocity of the flow in these coordinates is easily obtained: 

= (0,-3i/0-F^,i/) . (35) 
The canonical two-form on the constraint manifold C, 

u)c = {dpa ^da + dp^ Kd(l))\-H=Q- (36) 
is straightforwardly evaluated in the same coordinates, 

ujc = e^^ (~i(j)/H)d(j) AdX + (3</) -I- V^/H)dX hd(j) + d^h dcjj) . (37) 
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Dualizing, we now construct the magnetic field Ba = ^^abci^bc, 

(B^, B ; , Ba) = e^\-^/H, + V,^/H, -1), (38) 



where H = ±^ ^ (^"^^ + ~ ke^"^^. One can check that Ba is divergence- 

free, d^Bij, + d^B^ + dxB\ = 0. As explained at the end of the last section, the 
magnetic field is naturally expressed as the curl of a vector potential A = pidq^ . 
Explicitly, we have 

(^^,^^,AA) = e3^(0,O,-6iI), (39) 

with H expressed in terms of 4>, (p and A as above. 

For the purposes of our discussion, however, a more illuminating choice is 
to eliminate p^. This choice is nice because now two of the remaining three 
variables a and H have monotonic properties for fc < making it simple to 
construct a surface S which every classical trajectory crosses only once. From 
the Hamiltonian constraint 7Y we obtain 



= ±^J^pla^ - 2a6l/(0) + 6a4fc. (40) 

The square root again causes no problems: it just means the scalar field can 
generically have a positive, or negative velocity at each point {(j), H, A) in the 
classically allowed domain. In fact, for this choice of variables and for simple 
potentials which rise monotonically from a single minimum (like cj)'^ or 0'*), we 
find from (|31|) that for each H and A the range of (p is bounded by the two roots 
of the equation 

V{(l)) ^3{H^ + ke-^^). (41) 
In the coordinates (p, H and A, the velocity of the flow is: 



= (0, iJ, A) = (±^/m'^ -2V + Qk/a^, V - ~ 2fc/a^ Hj , (42) 
and the canonical two-form (1361) is 



u;c = e'\-6dH A dA ± A d(f> 

± , dHAd(j)). (43) 

Dualizing, the magnetic field is 

B.,i..)^6e3Vl,±l^^^-^^ + ^^-"", ^ ),(44) 



which is parallel to the flow ([3T|) and, again, divergence-free. The natural vector 
potential A = Pidq^ is found to be 

(^0, Ah, Ax) = e^-^ (±V6H^ + 6fce-2^, Q, -6ff) . (45) 
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The magnetic field we have calculated describes the flux per phase space 
area of all classical trajectories. All that remains is to choose a suitable surface 
S which the trajectories each cross once. As mentioned following equation 
for fc < 0, the Hubble parameter H is monotonically decreasing. Hence it 
makes sense to slice phase space on a surface H = 7J5=constant (or some mild 
deformation thereof). To compute the flux through the surface we must then 
integrate over the directions lying within the surface, parameterized by (j) 
and A, obtaining [52] 



The integral converges for negative A but diverges for large positive A: this is 
essentially the infinity identified by Hawking and Page. However, a key point is 
that in the limit of large A, the universe becomes spatially flat. In this limit, the 
value of A is neither geometrically meaningful nor physically observable. Our 
proposal for dealing with this physical degeneracy of solutions is to identify all 
universes which are indistinguishable on the chosen H = Hs slice. The natural 
dimensionless, geometrical measure of the curvature of space is the ratio of the 
space curvature term to the Hubble parameter term in the Friedmann equation, 
fife = —ke^'^^^/H'g. Our proposal is to identify universes for which \Vlk\ is 
smaller than some bound Afife. In doing so, we collapse the integral over large 
values of A, effectively introducing a cutoff in A, given by e^'*''"''^ — l/(AfifciJ|). 
As we now show, if Afife is low enough and Hs small enough that the expansion 
of the universe is adiabatic as far as the matter fields are concerned, then as far 
as some predictions are concerned, the cutoff dependence disappears from the 
result. 

We are left with an integral over a two-dimensional surface with a boundary 
A = constant. Stokes' theorem, / B • dS = ^ A • dl, along with equations (|45|) . 
([29|). ((32|) and a — e/^ , further reduce the measure to 



which is nothing but the standard expression for the adiabatic invariant [23j 

for a homogeneous field (j){t) evolving in a time-dependent background a{t). 
The result (|T7|) could have been anticipated at the outset by expressing u) = 
d{pada -\- P(j,d4>), and noting that the first term does not contribute when a = 
constant. It was necessary, however, to work through the intermediate steps as 
we have done in order to explicitly study the integral over A and show there is 
no additional surface term. Note that the a? factor in (j47|) is essential to the 
derived measure being conserved. One may, in the fc = case, eliminate H from 
((29)) using (jSO]) and thereby obtain a closed second order differential equation 
for (j). This gives an autonomous first order system in the ((/i, (j)) plane. However, 
this is not a Hamiltonian system as can be seen from the fact that it possesses 
an attractive fixed point. Therefore one may not use the simple measure d(j)d(j). 




(46) 




(47) 
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Now, let us justify our claim that any dependence on Afi^ disappears from 
the probability distribution for certain quantities, provided Hg is taken suffi- 
ciently low that the expansion of the universe is adiabatic as far as the matter 
fields are concerned. The point is that in this regime the scalar field oscillates 
rapidly, with frequency m ^ H, and its stress energy is accurately described 
by a perfect fluid with zero pressure. The evolution of the scale factor a{t) then 
becomes a background function of time, independent of the phase of the scalar 
field oscillation. Wc can then define the Hamiltonian for the matter field alone, 

2 

= + (48) 

in which a{t) is treated as a background variable. When the scalar field is 
oscillating about the minimum of V{<j))^ we can approximate the potential as 
^rri^cfp . Using this we can calculate the adiabatic invariant (|47p. obtaining 

= = 27rJ. (49) 

The "angle" variable canonically conjugate to J is then found to be 

= tan-if^^), (50) 

which obeys the following equation of motion: 

^ = -TO- --sin26l. (51) 
2 a 

In the regime of low H where the the canonical measure reduces to an 
adiabatic invariant, it becomes independent of 6. (A similar conclusion about 
the behaviour of the measure at late times was reached by Starobinsky in his 
early work on inflation-like models JT].) It follows that the canonical measure 
for loses its cutoff dependence in this regime. As we shall show in detail in 
the next section, for a universe which has undergone a substantial amount of 
inflation, 6 must necessarily lie in an exponentially narrow range. Once inflation 
is over, and the expansion of the universe may be treated as adiabatic, from (|5ip 
one can show that the separation of two nearby trajectories in 9 rapidly tends 
to a constant. Hence if we estimate the range of 9 corresponding to N e- folds 
of inflation, near the end of inflation, this estimate will remain valid for as long 
as the expansion (or contraction) of the universe may be treated as adiabatic. 
This justifies our claim that our calculated probability for N e-folds of inflation 
becomes independent of the cutoff Ailfc in the late universe. 

In contrast, the canonical measure for the scale factor a (or equivalently fife) 
is strongly cutoff dependent. To see this, note that up to a constant, J is just 
Pij,a^ where p^f, is the effective density of matter contributed by the scalar field 
oscillations. Defining Q.^ — p^/{iH'^) = 1 — 51^, Friedmann's equation (|29p 
yields 
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Figure 1: Set of classical trajectories for the inflationary model with V — ^w^cjp' 
and fc = 0, in the coordinates (0, 0, A), where A = Ina. The upper panel shows 
the projection onto the {(p, (p) plane, and the lower panel shows the projection 
onto the 0, A) plane. The dashed lines indicate the projection of the measure 
surface 5, which takes the form of an elliptical cylinder and which each trajec- 
tory crosses once. The parameters used were m? = 0.05, Hs ~ 0.1. 
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which, for fixed H = Hs, tends to unity for large a. The canonical measure, 
d{p^a^) then yields the probability distribution 

where amin — 0, Hg^ for k = +1, —1 and the upper cutofl^ Oc = Hg^{Ail,k)~^ ■ 
The resulting distribution favours flat universes, as Hawking and Page pointed 
out [12], but the result is dominated by the cutoff Ailfc. Note, however, that from 
our point of view we are not using the canonical measure as a physical theory 
which makes predictions, rather we are using it as a framework for assessing 
how predictive theories are. What we can conclude, however, is that from this 
point of view, a flat universe is not so surprising. 

Let us illustrate these remarks with the simple inflationary model Vi^cj)) = 
^rn?(jp , with m <C 1 in Planck units. We start with the spatially flat case, fc = 0. 
Figure ([T]) shows the evolution in the first set of variables, 0, 0, A. The upper 
panel is the classic plot |241 [55] showing the attractor behavior in the (0, 0) 
plane. The lower panel shows the projection onto 4>, A, showing how the scale 
factor grows in the various solutions. As the set of trajectories rise in A, they 
become more and more tightly twisted about the A axis. Notice in particular 
how all the solutions which have infiated for an extended period converge on a 
particular late-time oscillation phase. The dashed lines show a suitable measure 
surface S: the surface H = Hs= constant is an ellipse in the 0, plane, and 
a cylinder in the full three dimensional space. Our prescription amounts to 
cutting the cylinder on a surface A = constant, and integrating over the flux 
entering the cylinder below the cut. Since the magnetic field is divergence- free, 
this flux equals that leaving the cylinder on its upper cut surface A — constant. 
The total flux is given by a line integral of the vector potential around the 
ellipse. We have chosen Hs to be the Hubble constant near the end of inflation, 
when the slow roll approximation fails. As argued above, the resulting measure 
on the phase of the scalar field oscillation 6 becomes independent of the cutoff 
as Hs is reduced to still lower values. 

Figure [2] shows the trajectories in the (0, H) plane. As time runs forward 
from S in an expanding universe (blue curves), all the trajectories run down 
in H and bounce back and forth between the two boundaries H = ±m0/\/6 
an infinite number of times. Running time backwards from S (red curves), the 
trajectories run to higher H and generically end up kinetic-dominated. We will 
quantify this statement precisely in the next section. 

4 Slow-roll inflation 

We would like now to analytically estimate the measure ([TT]) in models of slow- 
roll inflation. Slow-roll inflation requires that the derivatives of V{(j)) are small: 
for fc = 0, the equations simplify and may be solved analytically as an expansion 
in derivatives of F(i^), as follows. From equations ([3T]) and (|43]) . with fc = 0, we 
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Figure 2: The set of trajectories in the (0, H) plane for V{(j)) = ^to^(/>^ and 
fc = 0, and the same parameters as in Fig. 1. The measure surface S is taken at 
H = 0.1, and the trajectories plotted are equally spaced in cj) on that surface. 
Only the trajectories with positive cj) on the measure surface are shown: those 
with negative (j) are obtained by mirror reflection. 



find 

Slow-roll inflation requires the dominance of the first term on the right hand 
side. Provided we have a potential with a minimum and no local maximum, we 
can just iterate this equation to find the slow-roll solution which inflates all the 
way back to the Planck scale. We find this solution to be 



HsrU) = \/-(l + — (— )^ + (-13(^)*-H16(- , 

+ 3l56^2^^(^)' - 432(^)^^ + 160(^f + 64^^) + . . .) (55) 

where we have implicitly assumed infiation rather than deflation. From the 
Friedmann equation (j3ip one sees that the sum of the sub-leading terms in this 
formula are due to the kinetic energy of the scalar field: one has 4> = 2{dH/d(j)). 

The inflationary trajectory (|55p is just one solution of the theory, and it has 
measure zero. What we are interested in is counting all of the solutions, and 
assessing how much inflation occurs in each one. This is very straightforward 
to do. It is convenient to take Hs to be the value of the Hubble constant near 
the end of inflation, just before the slow-roll approximation breaks down. The 
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measure ([5S)) then takes the simple form 





dH 


1 M(j)a^ 


dcj) 



(56) 

where the slope is calculated for each trajectory crossing Hs and the limits of 
the (j) integral are given by equation (f4T|) . 

Let us now calculate the number of e-foldings of expansion which have oc- 
curred as we trace the field evolution back in time, with the scalar field rolling 
back up the hill to some earlier value (j) which we shall take to be positive. The 
number of e-foldings is just N = J drH . From ([54)1 and the Friedmann equation 
([3T|) with fc = 0, we find 

— = ^ (57) 

which is an exact equation and docs not assume the slow-roll condition. Second, 
let us consider perturbing around the slow-roll solution to ([54|). given in ([55]) . 
Setting H Hsr + SH, we find to first order 

^ - ^^^^ - (58) 

or 

This remarkably simple equation is again exact, and valid for any inflationary 
potential V{4)). (Similar equations were derived, for example, in Ref. [26].) As 
we track the solution back in time, the deviation of H from the slow-roll so- 
lution grows with the number of e-foldings as exp(3A^). When the deviation 
in H becomes large, the solution departs from slow-roll. This occurs when 5H 
becomes of the same order as the first correction in ([55]). i.e., when the kinetic 
energy becomes significantly larger than that in the slow-roll solution. If 5H 
is positive, as we follow the solution back in time, the kinetic energy blueshifts 
and quickly overwhelms the potential energy so the solution traces back to a 
kinetic-dominated solution with (j) diverging to -|-oo. If 5H is negative, the ki- 
netic energy falls away to zero and the scalar field motion is turned around by 
the sloping potential. Again, tracking the solution back in time, the solution be- 
comes kinetic-dominated and the scalar field diverges to — oo. Slow-roll inflation 
occurs on the boundary between these two behaviors. 
The condition for slow-roll to break down is 

SH^SHse''^ t^Ih-J^] {N) = C{N), (60) 




where SHs is the deviation evaluated on our measure surface S, near the end 
of inflation, and C{N) is a relatively weak function of N. 

For large N our perturbed solution is very close to the slow-roll solution on 
S, and we can perform the integral (j56p to obtain the integrated probability for 
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Figure 3: Illustration of the tuning needed to create the inflationary solution as 
one follows (f> backward in time from a measure surface H = Hs- Trajectories 
which (p is too large on the measure surface encounter the classical turning 
point H = y^V{(p)/3, turn around and end up kinetic dominated. Trajectories 
in which the initial (f> is too small fall behind the inflationary solution and also 
head towards kinetic domination. 

N or more e-folds of inflation, 

This is the main result of this section. For example, if = r7i^0^/2, from 
dSil) we find HsR ~ (m0/V6)(l + 1/(3^2) _^ _ ^nd from N « (f>^/4. 

Hence C{N) « m/\/]V(l + 0{1/N)) at large N. We take the measure surface 
to be at the value of H where the slow roll approximation fails: from ([55|) we 
see this condition reads </) « 1 in Planck units, which leads to H ^ m. The 
normalization factor M ^ m and wc obtain P{N) « 7V~ 2exp(— 3A^), up to a 
numerical factor. 

Figure [3] shows the tuning needed to obtain a large number of inflationary 
e-folds. If we follow a trajectory back in time, to higher values of H , then 
the inflationary trajectory nestles close to the classical boundary H = jZ. 
The corresponding initial value of on the surface H ~ Hs is given by ([55|) . 
The series converges rather slowly but a very precise value is easily obtained by 
a numerical "shooting" procedure. If the initial (j) is greater than the critical 
value, then the trajectory hits the boundary, (j) reverses direction and ends up 
in kinetic domination, with H cx cxp(— ■\/3/2 </>). If, on the contrary, the initial 
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Figure 4: Same as Figure 2, but for an open universe with flk = 0.25 on the 
measure surface H = Hs- 

(j) is smaller than the critical value, the trajectory diverges from the boundary 
and becomes kinetic dominated with H oc exp(-y/3/2 (f). 

5 Curved Universes 

In the last section, we focused on flat universes because we found that the canon- 
ical measure possesses a divergence in the flat limit. Many of our arguments 
apply equally well for negatively curved universes (fc = —1), but for positively 
curved universes the good monotone properties of H and a do not persist. Nev- 
ertheless, for the purposes of our discussion all that we really need is that the 
spatial curvature has a modest effect when is small. This is obviously true 
but for the purposes of completeness, we exhibit the detailed behavior of the 
classical trajectories in the (0, H) plane for nonzero k. Figure ^ shows the 
negatively curved case, with a modest but non-negligible value of the the spa- 
tial curvature at the measure surface, and Figure ([5]) shows a similar picture 
for a positively curved universe. In both cases, it is apparent that the spatial 
curvature makes little difference to the kinetic-dominated trajectories when the 
Hubble parameter is large, since the scalar kinetic energy scales as compared 
to a^^ for the space curvature term. At lower values of H , the spatial curvature 
quickly dominates over the oscillations of the matter field, whose density scales 
as . Notice the non-monotonic behavior of H in the positively curved case. 
This has been analyzed in detail by Hawking and Page [12] . 
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Figure 5: Same as Figure 2, but for a closed universe with flk = —0.25 on the 
measure surface H — Hs- 

6 Comparison with previous discussions 

The most recent pubhshed discussion of these issues is in the papers of HoUands 
and Wald [T71 [TH] , and the response by Kofman, Linde and Mukhanov [27] . As 
we have already mentioned, because our measure is time reversal invariant, it 
does support Hollands and Wald's claim that deflation should be equally likely to 
inflation. As we have already mentioned, Hollands and Wald's discussion seems 
to criticize the canonical measure iljv/ on the grounds that it is hypersurface- 
dependent but as we have discussed in detail, this criticism does not apply to 
our situation, in which the surface H = Hs = constant is crossed once and only 
once by each classical trajectory. 

Kofman et al. argue that inflationary dynamics is "definitely not" measure- 
preserving because energy and entropy are created during inflation. However, 
the relevant energy is, for us, the total Hamiltonian 7i, which is always zero. 
Furthermore, the microscopic dynamics is Hamiltonian, and if one starts in a 
definite initial state, the fine-grained entropy is always zero. It is true that 
inflation is a non-adiabatic process, generating coarse-grained entropy. But the 
microscopic dynamics is measure-preserving and time-reversal invariant in that 
its definition, and indeed the definition of the multiverse M, does not pick out 
a particular direction of time. 

More specifically, Kofman et al. assume "chaotic" initial conditions in which 
a closed universe is born with roughly Planckian energy density in the scalar 
field kinetic energy, gradient energy and potential energy. The assumption of 
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equipartition seems to us hard to justify, since it is by no means clear that there 
has been time for any sort of equihbration process. Kofman et al. argue that a 
gradient- or kinetic-dominated closed universe would collapse in a Planck time 
and hence should be ignored. But, even if one insists upon considering a closed 
universe, as is well known, spatial inhomogeneities can allow an open universe 
to form within it via a process akin to bubble nucleation. Their argument 
does not exclude negatively curved, or nearly flat solutions which are kinetic- 
dominated at early times. In fact, we have found such solutions to dominate the 
canonical measure. Finally, Kofman et al. quote a formula for the probability 
for creation of a universe "from nothing" which involves an ad hoc sign flip of 
the usual formula for the Euclidean action. According to the usual formula, the 
most probable universe in fact has the lowest allowable value of V{4>), not the 
highest. So this argument again seems unconvincing to us. 

Finally, Kofman et al discuss the "attractor" behavior illustrated in the 
upper plot of Fig. [TJ They give an ad hoc measure which assumes Planckian 
initial energy density and a uniform distribution for the angle 9 between the 
initial value of and the (j) axis. They argue that all but 1 — 0{m) of the 

trajectories undergo inflation from the Planck era. Since m is of order 10^® in 
realistic models, they argue that inflation is virtually inevitable. However, they 
have omitted in their measure the factor in (|Tf)) . and this means that the 
probability of inflation they estimate will depend very sensitively on which circle 
centred on the origin in (</), (jj) space one chooses to evaluate it. Kofman et al. 
also consider imposing their measure at the end of inflation, and conclude that 
the probability is suppressed by 0{m). Hence they claim that even if one uses 
the late time measure, the probability of inflation is O(10~^). This estimate 
does not agree with our calculations. 

As we have seen, the probability of N e-folds of inflation in the canonical 
measure is roughly exp(— 3A^), independent of the cutoff AOfc or the value of 
Hs- For 60 e-folds, a typical minimal number for realistic models, we find a 
probability of approximately exp(— 180). In the m'^ip'^ model, he probability 
of inflation running all the way back to the Planck density is much smaller 
than this, approximately exp(— m~^) « exp(— 10^^)! This may be compared 
to the probability one obtains from quantum cosmology [14j if one adopts the 
usual sign for the Euclidean action, i.e., with probability oc exp(— S'^;) and 
Se ~ —247r^/y ((/)). Comparing a Planck-scale instanton, with (/> ^ m^^, with 
an instanton yielding only one e-fold of inflation, with 0^1, one flnds that the 
probability of Planck-scale inflation is ~ exp(— m~^), which is parametrically of 
the same order as our classical result. 

7 Conclusions 

We have developed the canonical measure for homogeneous, isotropic universes 
and, we hope, clarified its geometrical structure. We applied this measure to 
simple inflationary models and found it to be divergent, in agreement with 
Hawking and Page. However, we identified the source of the divergence as being 
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due to the geometrical and physical degeneracy of solutions which are spatially 
very flat, i.e., for which \ flk\ <C 1. Our proposal for removing the divergence is to 
identify universes which cannot be observationally distinguished, i.e., for which 
\^k \ is smaller than a critical value. For this purpose, it was convenient to work 
in coordinates </>, H and a in which the limit is easily imposed. The resulting 
integral over the scale factor a may then be performed, leading to an integral 
over the matter variables alone. We have shown that the resulting measure, 
when evaluated for low Hubble constant Hs on the measure surface, reduces 
to an adiabatic invariant for the matter fields. In particular, it is independent 
of the conjugate "angle" variable. It is this "angle" variable which becomes 
exponentially focussed during inflation. Hence we find that the probability of 
inflation, evaluated with the canonical measure, is exponentially small, and this 
result is very insensitive to the cutoff. 

Had we instead chosen to implement our prescription at a very large value of 
Hs, one for which many e- folds of inflation are possible, we would have found, 
on the contrary, a strong dependence of the results on the cutoff parameter Aflk. 
One could choose to live with this conclusion, and indeed if Hs is chosen large 
enough, and AClk small enough, one would find that most classical trajectories 
inflate. But there arc two reasons for being suspicious about the conclusion. 
First, the inferred probability measure will be strongly cutoff-dependent, allow- 
ing no firm conclusions to be drawn. Second, we find it hard to justify why, 
if one adopts a time-dependent measure, the decision as to whether universes 
are or are not geometrically distinct should be made at the Planck time, rather 
than today. These arguments suggest that some new ingredient or dynamical 
principle is needed in the theory, in order to explain why inflation began. 

On the other hand, in an anthropic, or "top down" approach to cosmology, 
the idea is to select universes on the basis of what they would be like at low 
H, not at their beginning. In that case, it seems clear that the appropriate 
measure should be a late-time measure like the one we have used, not an early 
one. The same statement would apply to any approach based on computing the 
probability of asymptotic "out" states. 

Even though we have argued that the measure we have developed is reason- 
able, and fulfils the conditions originally proposed by Gibbons, Hawking and 
Stewart, we have no argument that it is unique or even physically relevant. It 
seems to be a perfectly satisfactory a priori measure, that is, an unbiased es- 
timator of our ignorance. However, its status as such is rather different from 
probability distributions which have arisen as a result of some equilibration pro- 
cess. In our case, there is no obvious physical mechanism which allows different 
members of the ensemble to interact, and without such interactions, the notion 
of equilibration is not relevant. What the canonical measure does is allow one 
to discuss in a quantitative way how different proposals for the big bang, or the 
beginning of the universe, cut down the space of classical trajectories and hence 
make predictions about the state of the universe today. 

The fact that the canonical measure, with what seems a sensible resolution 
of its divergence, strongly disfavors many e-folds of inflation, poses a serious 
puzzle for inflationary theory. It is important, we believe, for inflationary the- 
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ory to explain why the kinetic-dominated trajectories which we have found to 
overwhelmingly dominate the canonical measure are somehow excluded. Indeed, 
the measure we have calculated is generous to inflation in that we have assumed 
spatial homogeneity and isotropy, and the canonical measure gives equal weight 
to every distinct classical solution, even those with very high potential energy 
density early on. The main conclusion we draw from this work is that the ques- 
tion of why or how inflation started remains a deep mystery, and a challenge for 
fundamental theory. Until that question is answered, we should remain cautious 
about claiming that cosmology's classic puzzles are "solved" . 
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